SQLSAM: SQL for statistical analysis and modeling
نویسندگان
چکیده
Abstrait Statistical modeling and analysis is extensively used in businesses for various purposes including graphic visualization of data, measurement of central tendencies and other statistics, and inferences on populations based on samples. Data are the fundamental component of each of these activities. In this paper an extension of the standard database language SQL for statistical modeling and analysis is presented. Models covered include descriptive analytic and graphic measures, discrete probability distributions, continuous probability distributions, inferential statistics, and regression analysis. Through the use of SQLSAM, seamless integration of existing data in organizations’ databases and their statistical analysis can be achieved.
منابع مشابه
Splash: Integrated Ad-Hoc Querying of Data and Statistical Models
This paper presents a system called Splash, which integrates statistical modeling and SQL for the purpose of adhoc querying and analysis. Splash supports a novel, simple, and practical abstraction of statistical modeling as an aggregate function, which in turn provides for natural integration with standard SQL queries and a relational DBMS. In addition, we introduce and implement a novel repres...
متن کاملWhat is the Population of Interest: Population Modeling for BayesDB
BayesDB [1, 2] is a probabilistic programming platform that enables users to solve probabilistic data analysis problems using a simple, SQL-like language. Queries execute against generative population models (GPMs), a new abstraction that can be used to integrate data, metadata, qualitative domain knowledge, and quantitative models. Baseline quantitative models are typically built via an AI mod...
متن کاملModeling of Banks Bankruptcy in Iran (Multivariate Statistical Analysis)
In this paper we construct a modeling for detection of banks which are experiencing serious problems. Sample and variable set of the study contains 30 banks of Iran during 2006-2014 and their financial ratios. Well known multivariate statistical technique (principal component analysis) was used to explore the basic financial characteristics of the banks, and discriminant Logit and Probit models ...
متن کاملBayesDB: A probabilistic programming system for querying the probable implications of data
Is it possible to make statistical inference broadly accessible to non-statisticians without sacrificing mathematical rigor or inference quality? This paper describes BayesDB, a probabilistic programming platform that aims to enable users to query the probable implications of their data as directly as SQL databases enable them to query the data itself. This paper focuses on four aspects of Baye...
متن کاملStatistical Shape Modeling of Musculoskeletal Structures and Its Applications
Statistical shape models (SSM) describe the shape variability contained in a given population. They are able to describe large populations of complex shapes with few degrees of freedom. This makes them a useful tool for a variety of tasks that arise in computer-aided medicine. In this chapter we are going to explain the basic methodology of SSMs and present a variety of examples, where SSM has ...
متن کامل